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SUMMARY 


Despite  the  great  progress  in  devising  technological  aids 
to  decision  makers,  most  decisions  are  largely  or  entirely  the 
product  of  intuitive,  psychological  processes.  Many  times, 
aids  are  not  available  (e.g. ,  under  field  conditions);  at  other 
times,  too  many  decisions  have  to  be  made  to  rely  on  more  than 
educated  intuition  (e.g.,  those  made  by  an  officer  or  executive 
during  a  routine  day) ;  even  when  the  stakes  involved  clearly 
justify  investment  in  aids,  time  pressures  may  make  that 
impossible  (e.g.,  crisis  situations).  Moreover,  the  use  of 
formal  aids  has  a  strong  judgmental  component. 

If  these  psychological  processes  are  to  be  improved,  they 
must  first  be  understood.  For  many  years,  the  dominant  model 
for  describing  many  kinds  of  decisions  has  been  a  weighted 
average  model.  In  this  view,  people  make  decisions  on  the 
basis  of  arithmetic  calculations  that  systematically  consider 
the  magnitude  of  all  possible  consequences  of  a  decision  and 
the  chances  of  achieving  them. 

Reviewing  many  research  results,  the  present  report 
concludes:  (a)  the  ability  of  the  weighted  average  models  to 

predict  the  decisions  that  people  make  varies  from  situation 
to  situation;  (b)  even  where  they  are  successful  in  predicting 
decisions,  these  models  may  not  do  a  very  good  job  of  describing 
the  psychological  processes  that  determined  them.  That  is, 
their  predictive  accuracy  is  in  part  an  artifact  of  the  research 
techniques  used. 

Without  an  understanding  of  how  people  make  decisions, 
it  is  difficult  to  devise  ways  of  enabling  them  to  make  better 


decisions.  The  report  concludes  with  several  alternative 
viewpoints  on  how  people  make  decisions  with  the  ultimate  aim 
of  devising  decision  aids  compatible  with  decision  makers' 
natural  modes  of  thought. 
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A  simple  and  comprehensive  rule  for  making  decisions  is  the 
following.  List  all  feasible  courses  of  action.  For  each  ac¬ 
tion,  enumerate  all  possible  consequences.  For  each  consequence, 
assess  the  attractiveness  or  aversiveness  of  its  occurrence,  as 
well  as  the  probability  that  i.t  will  be  incurred  should  the  ac¬ 
tion  be  taken.  Compute  the  expected  worth  of  each  consequence 
by  multiplying  its  worth  by  its  probability  of  occurrence.  The 
expected  worth  of  an  action  is  the  sum  of  the  expected  worths 
of  all  possible  consequences.  Once  the  calculations  are  comple¬ 
ted,  choose  the  action  with  the  greatest  expected  worth. 

Studying  the  prescriptive  and  descriptive  validity  of  this 
model  is  the  preoccupation  of  researchers  in  a  diffuse  field 
known  as  behavioral  decision  theory  (BDT) .  Some  look  at  the 
model  as  a  whole,  others  concentrate  on  its  components  (e.g., 
assessments  of  worth  or  probability) ,  and  still  others  concern 
themselves  with  special  cases.  The  most  general  form  is  known 
as  the  subjective  expected  utility  (SEU)  model.  In  it,  proba¬ 
bilities  are  treated  as  being  subjective  whereas  worth  is  ex¬ 
pressed  in  utility,  a  generalized  measure  xrt  desirability.  The 
S  is  dropped  if  probabilities  are  considered  to  be  "objective" 

(a  distinction  clarified  below) .  The  U,  for  utility,  becomes 
a  V,  for  value,  if  an  absolute  standard  of  worth,  like  dollars, 
is  used.  The  E  is  dropped  if  probabilities  are  ignored,  i.e., 
if  there  is  no  element  of  expectancy  and  receipt  of  all  conse¬ 
quences  is  viewed  as  a  certainty.  Most  of  the  expectancy-value 
(E-V)  models  described  elsewhere  in  this  book  would  in  this  jar¬ 
gon  be  called  either  EU  or  SEU  models;  they  use  subjective  judg¬ 
ments  of  worth  and  consider  uncertainty,  but  take  no  position 
on  the  meaning  of  probability. 

The  essay  that  follows  traces  the  history  of  SEU  research. 
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with  particular  emphasis  on  its  varied  intellectual  roots  and 
the  insights  and  blind  spots  they  have  entailed.  Like  any  other 
discipline,  its  progress  has  been  speckled  by  fits,  starts  and 
occasional  extended  pursuit  of  deadends.  These  are  discussed 
in  moderately  blunt  terms  with  an  eye  to  highlighting  the  hard- 
earned  lessons  that  seem  to  be  most  relevant  to  expectancy- 
value  (E-V)  research.  A  final  section  considers  less  obliquely 
the  question  of  what  the  two  approaches,  E-V  and  BDT,  may  have 
to  learn  from  one  another.  To  presage  that  discussion,  the  great 
disadvantage  lies  in  the  complexity  of  those  settings;  their 
great  disadvantage  lies  in  the  complexity  of  those  settings. 

Many  of  the  conclusions  of  behavioral  decision  theory  regarding 
the  intricacies  of  decision  behavior  and  its  study  were  slowly 
and  painstakingly  discovered  and  documented  in  highly  constrained 
experimental  stituations.  Such  subtle  signals  might  never  have 
been  detected  in  the  flesh.  Conversely,  field  research  of  the 
E-V  type  reminds  us  of  the  key  variables  that  should  be  presen¬ 
ted  in  laboratory  work. 

FORMAL  MODELS  OF  DECISION  MAKING 

Attempts  to  describe  how  people  make  decisions  took  their 
initial  marching  orders  from  prescriptive  models,  telling  how 
people  should  make  decisions.  Variants  of  the  SEU  model,  as 
elaborated  by  statistical  and  Bayesian  decision  theory,  were 
the  most  prominent.  These  models  came  from  the  vicinity  of  the 
intersection  of  economics  and  philosophy  and,  indeed,  the  first 
experimental  studies  of  decision  making  were  conducted  by  members 
of  those  professions.  Two  essays  by  Ward  Edwards  (1954,  1961) 
provide  comprehensive  and  stimulating  introductions  to  initial 
stages  of  this  field. 

The  appeal  of  the  models  was  obvious.  They  reduced  the 
universe  of  decisions  to  a  common  set  of  primitives  (options. 
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probabilities,  utilities)  about  which  one  could  hope  to  derive 
universal  truths  (e.g.,  people  exaggerate  low  probabilities)  in 
due  time.  To  some  extent,  this  promise  has  been  borne  out  by 
reality.  Three  hundred  articles  in  behavioral  decision  theory 
written  yearly  probably  produce  more  cumulative  knowledge  than 
the  three  thousand  publications  in  personality  research  (Gold¬ 
berg,  1974)  .  A  second  enticement  was  the  presence  of  an  axioma¬ 
tic  basis  for  these  models.  Any  individual  who  subscribes  to  a 
small  set  of  reasonable  rules  (e.g.,  transitivity)  should  behave 
in  accordance  with  the  model.  Prescriptive  reasonableness  is 
presumably  a  necessary  condition  for  the  descriptive  validity  of 
a  law  of  conscious  behavior.  In  mainstream  American  economics, 
the  idea  that  people  maximize  V  or  U  or  EV  or  SEU  or  multiattri¬ 
bute  SEU  has  the  status  of  a  meta-theory.  It  goes  without  saying 
that  such  maximization  is  what  people  try  and  manage  to  do.  The 
goal  of  the  theorist  is  not  to  test  this  assumption,  but  to  divine 
just  what  it  is  that  people  are  maximizing.  Like  other  meta¬ 
theories,  SEU  is  not  falsifiable  and  thus  not  a  theory  at  all 
from  this  perspective.  With  sufficient  ingenuity,  one  can  al¬ 
ways  find  something  that  a  particular  decision  maker  has  maxi¬ 
mized  in  a  particular  situation. 

The  story  of  behavioral  decision  theory  has  been  the  growing 
realization  that  SEU  often  does  not  describe  the  decision-making 
process  either  as  it  is  designed  or  executed.  The  dramatic  ten¬ 
sion  has  been  provided  by  SEU's  remarkable  ability  to  hang  on 
despite  mounting  doubts  about  its  descriptive  competence.  The 
climax  of  this  yet- incomplete  tale  seems  to  have  something  to  do 
with  realizing  that  a  clearly  erroneous  model  still  may  serve 
some  useful  role.  The  denouement  (or  perhaps  sequel)  may  involve 
elaborating  and  circumscribing  that  role.^ 

We  believe  that  key  clues  to  the  future  of  the  SEU  model 
(and  its  E-V  counterpart)  can  be  found  by  going  back  and  under- 
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standing  how  it  has  managed  to  hang  on  despite  decades  of  wither¬ 
ing  attack.  The  retrospective  look  that  follows  suggests  the 
secret  to  its  longevity  lies  in  the  fact  that  the  model  (a)  ex¬ 
presses  some  fundamental  truth  in  some  situations,  (b)  has  a 
remarkable  ability  to  be  close  to  the  mark  in  situations  where 
its  underlying  logic  is  clearly  inappropriate  and  (c)  has  propo¬ 
nents  of  heroic  tenacity. 

Model  Fitting 

One  way  to  use  the  SEU  model  is  in  explaining  past  decisions. 
A  set  of  similar  decisions  is  collected  and  the  researcher  sets 
about  identifying  a  set  of  predictors  (probabilities  and  utili¬ 
ties)  which,  when  combined  according  to  the  SEU  rule  will  account 
for  those  decisions.  This  search  may  involve  not  only  trying 
different  predictors,  but  also  different  ways  of  measuring  and 
transforming  those  predictors. 

Underlying  this  approach  is  a  fairly  reasonable  research 
strategy.  It  is  extraordinarily  difficult  to  study  simultane¬ 
ously  the  form  of  the  decision  models  that  people  use  and  the 
substantive  considerations  incorporated  in  them;  therefore,  let 
us  assume  the  truth  of  this  general  model  and  devote  our  efforts 
to  seeing  what  inputs  will  make  it  work.  Specifically,  assume 
that  people  use  the  SEU  model  and  find  what  their  probabilities 
and  utilities  are.  With  such  a  strategy,  it  is  hard  to  lose  in 
the  short  run.  The  clever  researcher  can  usually  find  some  set 
of  values  in  a  particular  situation  that  will  give  good  enough 
predictions  or  pcstdictions  to  maintain  faith  in  the  model. 

Any  descriptive  failure  by  an  SEU  model  can  be  explained 
away  as  a  failure  in  measurement  rather  than  the  use  of  the  wrong 
combination  rule.  The  wrong  consequences  were  included;  their 
worth  was  assessed  in  an  unreliable  or  confusing  manner;  uncer¬ 
tainties  were  ignored  or  elicited  improperly  and  so  on.  Some  of 
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the  more  imaginative  fudge  factors  fall  into  the  categories  of 
transaction  costs.  Decision  making  itself  is  seen  as  incurring 
such  costs  and  the  process  is  managed  so  as  to  keep  those  costs 
down.  Thus  a  decision  that  seems  to  select  a  suboptimal  alterna¬ 
tive  can  be  reinterpreted  as  the  result  of  truncating  or  simpli¬ 
fying  the  model  so  as  to  minimize  transaction  costs."^ 

The  complementary  research  strategy  is  to  presume  that  one 
knows  the  probabilities  and  utilities  to  which  people  attend 
and  then  experiment  with  different  combination  rules  until  one  is 
found  that  works.  The  price  one  pays  for  strict  adherence  to 
either  strategy  is  that  as  the  number  of  predictors  (different 
variables,  measurement  techniques,  data  transformations,  etc.) 
increases,  the  model  becomes  increasingly  vague  and  imprecise. 
Given  a  sufficiently  large  set  of  probabilities  and  utilities, 
one  can,  of  course,  devise  a  rule  "predicting"  past  decisions  to 
any  desired  level  of  proficiency.  In  regression  terms,  by  ex¬ 
panding  the  set  of  independent  variables  one  can  always  find  a 
set  of  predictors  (or  even  one  predictor)  with  any  desired  cor¬ 
relation  with  the  independent  variable.  The  price  one  pays  for 
overfitting  is  shrinkage,  failure  of  the  rule  to  "work"  on  a 
new  sample  of  cases.  With  well-defined  problems,  the  predictor/ 
sample  ratio  allows  one  to  "correct  for  shrinkage"  and  estimate 
the  predictive  validity  of  an  SEU  model  for  future  decisions. 

Such  corrections  are  impossible  when  it  is  unclear  just  how  many 
variants  of  the  model  have  been  tried  and  discarded  before  the 
best-fitting  one  was  identified.  Thus,  when  an  SEU  model  "works" 
on  a  set  of  data,  it  is  difficult  to  know  what  that  proves. 

Have  we  proven  that  the  decision  makers  we  are  studying  are  using 
SEU,  and  here  are  their  weights?  Or  has  our  fishing  expedition 
finally  stumbled  upon  an  SEU-like  model  which  mimics  their  actual 
behavior? 


Simple  Models  or  Simple  Processes? 

Growing  unease  with  the  elusiveness  of  the  proper  SEU  model 
was  compounded  by  results  emerging  from  the  study  of  clinical 
judgment,  another  intellectual  tradition  upon  which  behavioral 
decision  theory  draws.  Clinical  judgment  is  exercised  by  a 
radiologist  who  sorts  X  rays  of  ulcers  into  "benign"  and  "malig¬ 
nant,"  by  a  personnel  officer  who  chooses  the  best  applicant 
from  a  set  of  candidates,  by  a  crisis  center  counselor  who  decides 
which  callers  threatening  suicide  are  serious .  In  each  of  these 
examples,  the  diagnosis  involves  making  a  decision  on  the  basis 
of  a  set  of  cues  or  attributes.  When,  as  in  these  examples, 
the  decision  is  repetitive  and  all  cases  can  be  characterized 
by  the  same  cues,  it  is  possible  to  model  the  judges'  decision¬ 
making  policy  statistically.  Following  the  logic  of  the  SEU 
modeling  described  in  the  previous  section,  a  set  of  decisions 
are  gathered  and  multiple  regression  is  used  to  determine  the 
cue-weighting  scheme  that  best  accounts  for  the  decisions  actu¬ 
ally  made  by  the  judge. 

Two  decades  of  such  pol icy- capturing  studies  persistently 
produced  a  disturbing  pair  of  conclusions:  simple  linear  models, 
using  a  weighted  sum  of  the  cues,  did  an  excellent  job  of  post- 
dieting  judges'  decisions,  despite  the  judges'  claim  that  they 
were  using  much  more  complicated  strategies  (Goldberg,  1968,  1970; 
Slovic  &  Lichtenstein,  1971) .  A  commonly  asserted  form  of  com¬ 
plexity  is  called  "configural"  judgment,  in  which  the  diagnostic 
meaning  of  one  cue  depends  upon  the  meaning  of  other  cues  (e.g., 
"that  tone  of  voice  makes  me  think  'not  suicidal'  unless  the  call 
comes  at  mid-day") . 

Two  reasons  for  this  contrast  have  emerged,  each  with  nega¬ 
tive  implications  for  the  descriptive  validity  of  SEU.  One  reason, 
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fed  by  other  developments  in  cognitive  psychology  (e.g..  Miller, 
1956;  Simon,  1957),  involved  the  growing  realization  that  com¬ 
bining  enormous  amounts  of  information  in  one's  head  overwhelms 
the  computational  capacity  of  anyone  but  an  idiot  savant.  A 
judge  trying  to  implement  a  complex  strategy  simply  would  not 
be  able  to  do  so  with  great  consistency.  Indeed,  it  is  diffi¬ 
cult  to  learn  and  use  even  a  non-conf igural,  weighted-sum 
decision  rule  when  there  are  many  cues  or  unusual  relationships 
between  the  cues  and  predicted  variable  (Slovic,  1974)  .  Since 
the  SEU  model  is  such  a  weighted-sum  rule,  these  results  suggest 
that  people  could  not  use  it  even  if  they  tried. 

The  second  realization  that  emerged  from  clinical  judgment 
research  is  that  simple  linear  models  are  extraordinarily  power¬ 
ful  predictors.  If  one  can  identify  and  reliably  measure  the 
attributes  relevant  to  a  decision  maker,  one  can  mimic  his  or 
her  decisions  to  a  large  degree  with  simple  models  bearing  no 
resemblance  to  the  decision  makers'  cognitive  processes.  That 
is ,  one  can  misspecify  weights  and  even  combination  rules  and 
still  do  a  pretty  good  job  of  predicting  decisions  under  very 
general  conditions  (Dawes,  1979). 

This  discovery  proves  devastating  from  two  perspectives. 

One  is  that  whatever  people  are  doing  will  look  like  the  appli¬ 
cation  of  an  additive  linear  model  (like  SEU  or  E-V) .  In  Hoff¬ 
man's  (1960)  term,  such  models  are  paramorphic  in  that  they  can 
replicate  the  input-output  relations  of  the  phenomena  they  are 
meant  to  describe  without  any  guarantee  of  fidelity  to  the  under¬ 
lying  processes.  Thus,  even  if  such  a  model  predicts  subjects' 
behavior,  one  cannot  be  certain  that  they  are  actually  using 
such  a  decision  rule. 

Secondly,  one  cannot  take  the  results  of  attempts  to  charac¬ 
terize  decision  makers'  specific  policies  seriously.  If  one 


7 


assumes  that  people  are  using  an  SEU  or  other  additive  linear 
model,  their  personal  decision  model  could  seemingly  be  cap¬ 
tured  by  finding  the  weighting  schemes  that  best  predict  their 
actual  behavior.  Unfortunately,  the  best  predicting  weights 
determined,  say,  by  standard  regression  procedures,  become  in¬ 
creasingly  unstable  and  uninterpretable  when,  as  is  usually  the 
case,  there  is  any  dependence  between  the  cues,  or  multicollin- 
earity.  Thus,  even  positing  the  (arguable)  accuracy  of  the  ad¬ 
ditive  linear  model,  it  is  hard  to  tell  what  the  judge  is  up 
to,  even  when  behavior  can  be  predicted  quite  well. 

A  possible  solution  to  this  problem  is  to  create  stimuli 
or  tasks  with  no  dependence  between  attributes.  Slovic  (1969) , 
for  example,  used  a  factorial  design  to  describe  stocks  that 
were  evaluated  by  experienced  investors.  The  variance  explained 
by  each  dimension  (or  attribute)  could  be  used  to  determine  its 
importance  in  an  additive  linear  decision  model.  A  drawback 
to  this  solution  is  that  it  goes  against  another  of  the  intel¬ 
lectual  roots  of  behavioral  decision  theiry,  Brunswik's  prob¬ 
abilistic  functionalism  (1952;  Hammond,  1966)  .  From  this  per¬ 
spective,  the  study  of  behavior  involves  understanding  how  people 
adapt  to  an  uncertain  or  probabilistic  world.  Central  to  their 
adaptation  is  learning  the  natural  relationships  between  cues. 

A  research  design  that  destroys  these  relationships  (e.g.,  the 
factorial  one  described  above)  lacks  ecological  validity.  When 
confronted  with  such  an  unnatural  stimulus  environment,  the 
individual  can  only  effect  some  sort  of  (meaningless?)  adapta¬ 
tion  of  natural  behavior  to  this  unique  situation.  That  compro¬ 
mise  may  be  distinctly  unenlightening  about  real-world  behavior, 
the  world  which  E-V  researchers  wish  to  study. 

The  cumulative  effect  of  these  problems  was  to  cast  further 
doubt  on  the  usefulness  of  research  either  testing  the  validity 
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of  the  SEU  model  or  attempting  to  explicate  its  usage  through 
policy  capturing.  Despite  the  predictive  validity  of  these 
attempts,  they  seemed  to  yield  little  trustworthy  knowledge 
about  how  people  actually  make  decisions.  The  response  of  the 
research  community  was  a  parting  of  ways  between  two  camps  which 
might  be  described  as  involving  cognitive  applied  and  applied 
cognitive  psychologists  (Baddeley,  1977;  Wright,  1978).  The 
latter  were  interested  in  how  people  think  in  the  general  ap¬ 
plied  context  of  decision  making;  their  progress  is  described 
in  subsequent  sections . 

The  former  were  interested  in  specific  applied  problems 
whose  locus  was,  at  least  in  part,  in  cognitive  decision  making. 
For  them,  the  predictive  power  of  these  models  provided  a  highly 
useful  tool.  Often  one  doesn't  care  how  decisions  are  made,  as 
long  as  one  can  predict  their  outcome.  If  it  works,  a  para- 
morphic  model  may  be  good  enough  for  designing  an  effective 
marketing  campaign  or  remuneration  scheme.  Such  circumscribed 
successes  are  all  that  many  applied  E-V  researchers  need.  Al¬ 
though  even  there,  some  understanding  of  how  people  process 
information  might  help  present  options  in  the  most  accessible 
manner  possible. 

At  times,  a  valid  predictive  model  might  be  used  to  replace 
clinical  judges  by  formulae  modeling  their  behavior.  Called 
"bootstrapping,"  this  approach  calls  for  having  decision  makers 
identify  the  variables  upon  which  they  base  their  decisions  and 
then  "capture"  the  policy  they  use  with  a  set  of  trial  cases. 
That  policy,  as  embodied  in  a  formula,  is  then  used  in  place  of 
the  judges.  Even  if  it  does  not  embody  their  thought  processes, 
it  may  mimic  their  decisions  with  greater  reliability  (and 
hence  validity)  than  the  judges  themselves.  Formulae  never 
have  off-days  or  suffer  from  fatigue  or  distraction  by  irrele¬ 
vant  cues  (Slovic  &  Lichtenstein,  1971) .  Clearly,  people  know 
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more  than  is  included  in  such  formulae.  But  there  is  no  evi¬ 
dence  to  date  indicating  that  they  can  convert  this  sensitivity 
to  the  richness  of  life  situations  into  superior  predictions. 

Empirically  discovering  an  analytical  result  by  Wilks 
(1938) ,  Dawes  and  Corrigan  (1974)  showed  that  considerable 
predictive  success  is  possible  without  almost  any  modeling  at 
all.  All  one  has  to  do  is  to  identify  the  variables  (or  attri¬ 
butes)  to  which  a  decision  maker  attends  and  decide  whether 
they  are  positively  or  negatively  related  to  the  decision  cri¬ 
terion.  If  these  variables  are  expressed  in  standard  units, 
they  can  be  given  unit  weights  (+1  or  -1,  as  appropriate) .  Such 
a  unit  weighted  model  will,  under  very  general  conditions,  pre¬ 
dict  decisions  as  well  as  a  weighted-sum  model  using  regression 
weights . 

Thus,  a  simple  substantive  theory  indicating  what  variables 
people  care  about  when  making  decisions  may  be  all  ohe  needs  to 
make  pretty  good  predictions  of  their  behavior.  If  they  like 
more  of  good  things  and  less  of  bad  things,  just  count  up  the 
number  of  good  things  an  option  leads  to  and  subtract  the  number 
of  bad  things  and  you  have  a  good  idea  of  how  favorably  it  will 
be  viewed.  The  predictive  power  of  such  a  simple  model  provides 
a  base  line  against  which  more  sophisticated  models  could  be 
compared . 

Obviously,  some  goods  (positive  attributes)  are  more  impor¬ 
tant  than  others.  Therefore,  a  model  using  importance  weights 
should,  in  principle,  predict  better  than  one  using  unit  weights. 
Similarly,  goods  obtained  with  high  probability  should  be  valued 
more  than  those  obtained  with  low  probability.  Therefore,  a 
model  using  probability  weights  should  in  principle  predict  better 
than  one  using  unit  weights.  However,  any  unreliability  or 
misspecif ication  of  those  weights,  due  to  poor  procedure  or  multi- 
collinearity ,  reduces  their  contribution  very  rapidly.  In  the 
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extreme,  models  using  poorly  conceived  or  executed  weighting 
schemes  may  succeed  in  spite  of  rather  than  because  of  their 
differences  from  the  unit-weight  model. 


All  of  these  discoveries  from  the  area  of  clinical  judgment 
have  rather  chilling  implications  for  the  use  of  modeling  tech¬ 
niques  for  determining  whether  people  use  E-V  decision-making 
models  and,  if  so,  how.  In  short,  the  success  of  an  E-V-type 
model  in  predicting  behavior  proves  very  little  besides  member¬ 
ship  of  the  model  in  the  family  of  powerful  linear  models. 

Neither  the  degree  of  success,  nor  the  specific  importance  weights 
used  allow  unambiguous  interpretation. 

Divide  and  Conquer 


Feeling  that  the  SEU  model  was  impregnable  and  impenetrable 
when  dealt  with  as  a  whole,  the  applied  cognitive  psychologists 
among  behavioral  decision  theorists  have  largely  turned  to  an 
examination  of  the  model's  parts  and  their  validity.  Do  people 
accept  the  axioms  upon  which  the  model  is  based?  Are  they  capa¬ 
ble  of  providing  the  inputs  that  it  requires?  Are  their  intui¬ 
tions  attuned  to  the  kind  of  thinking  embodied  in  SEU?  In  simpli¬ 
fied  situations,  which  relieve  their  computational  load,  do 
people  exhibit  SEU-like  behavior?  Are  they  sensitive  to  factors 
that  have  no  representation  in  SEU? 

PRESCRIPTIVE  VALIDITY 

One  great  attraction  of  SEU  approaches  is  that  they  repre¬ 
sent  reasonable  decision  rules,  ways  in  which  people  might  want 
to  make  decisions;  such  an  attractive  prescriptive  model  would 
be  a  sensible  point  of  departure  for  developing  a  valid  descrip¬ 
tive  model.  The  form  of  the  SEU  model  (Ep^u^)  states  that  ac¬ 
tions  become  more  attractive  as  their  good  consequences  become 


11 


more  appealing  (u^  increases)  and  more  likely  (p^  increases) 
or  as  their  less  appealing  consequences  become  less  likely. 

Since  the  expected  utilities  of  various  possible  consequences 
are  added,  low  or  negative  utility  associated  with  one  conse¬ 
quence  can,  in  principle,  be  compensated  for  by  sufficiently 
high  utility  on  others.  If  people  wish  to  have  their  decision 
making  guided  by  these  rules,  the  study  of  decision  making  be¬ 
comes  an  analysis  of  their  ability  to  carry  them  out. 

When  might  people  reject  these  rules? 

(a)  When  more  of  a  good  thing  is  not  better.  Such  situ¬ 
ations  can,  in  principle,  be  handled  by  having  a  non-linear 

or  even  non-monotonic  relationship  between  the  magnitude  of  a 
consequence  and  its  utility. 

(b)  When  the  appeal  of  a  positive  consequence  does  not 
increase  linearly  with  the  certainty  of  its  being  attained  (p^) . 
Atkinson  and  Feather  (1966)  and  others  have  shown  that  at  times, 
people  will  forego  an  increment  in  probability  of  success  in 
order  to  be  challenged  by  a  situation  or  to  have  some  suspense 
associated  with  its  outcome.  For  example,  one  might  prefer  a 
class  in  which  the  top  80%  receive  A's  to  one  in  which  A's 

are  given  to  everyone  automatically.  SEU  models  cannot  directly 
accommodate  assigning  intrinsic  values  to  probabilities  of 
success.  However,  it  is  possible  to  redefine  the  consequences 
so  as  to  save  the  model.  In  the  example,  the  "cinch  A"  and 
the  "earned  A"  are  different  consequences  whose  utilities  may 
differ  by  more  than  the  difference  in  probabilities  (.8  and 
1.0),  whose  values  are  treated  as  absolutes. 

Because  SEU  researchers  have  been  interested  in  developing 
a  general  descriptive  theory  of  decision  making,  there  have 
been  relatively  few  attempts  to  describe  the  substantive 
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situations  in  which  such  dependencies  arise.  One  does  not 
even  know  if  they  occur  commonly  or  merely  in  a  small,  but 
important  class  of  achievement-related. circumstances . 

One  interpretation  of  such  dependence  of  value  upon  prob¬ 
ability  is  as  a  reflection  of  risk  aversiveness.  Risk-averse 
individuals  attach  a  value  to  certainty  itself,  whereas  risk¬ 
seeking  individuals  do  not.  There  have  been  some  attempts  to 
study  individual  differences  in  propensity  to  take  risks,  how¬ 
ever,  these  have  foundered  on  the  apparent  absence  of  such 
differences  (e.g.,  Davidshofer,  1976;  Wright  &  Phillips,  in 
press) .  Poorly  understood  situational  variables  seem  to  over¬ 
whelm  individual  differences  in  determining  risk  aversion.  With¬ 
in  the  achievement  literature,  the  individual  difference  varia¬ 
ble  of  nAch  is  well  known.  It,  too,  however,  is  often  over- 
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whelmed  by  situational  manipulations  of  achievement  imagery. 

(c)  When  options  are  evaluated  by  non-compensatory  criteria. 
Two  possible  non-compensatory  rules  are  the  conjunctive  and  dis¬ 
junctive.  By  the  conjunctive  rule,  an  option  has  to  score  fairly 
high  on  each  consequence  to  be  considered.  For  example,  a  vaca¬ 
tion  option  must  be  reasonbly  priced,  available  when  needed, 
suitably  sunny  and  fairly  quiet.  If  an  option  failed  to  pass 
muster  on  any  one  of  these  attributes,  its  rating  on  the  others 
would  be  immaterial,  e.g.,  no  amount  of  sun  will  compensate  for 
a  lot  of  noise.  These  minimal  levels  are,  in  a  sense,  non-nego- 
tiable  demands.  According  to  the  disjunctive  rule,  an  option 
that  is  adequate  on  any  one  attribute  is  acceptable.  For 
example,  an  investment  opportunity  might  be  chosen  if  it  were 
good  enough  as  a  speculation,  tax  shelter  or  hedge  against 
inflation,  no  matter  how  badly  it  rated  on  the  other  dimensions. 
Investment  portfolios  often  include  varied  items  chosen  by  the 
same  disjunctive  rule.  Compound  strategies  are  also  possible, 
such  as  using  a  disjunctive  or  conjunctive  rule  to  reduce  a 
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large  set  of  options  to  a  smaller  one  to  which  a  compensatory 
rule  is  then  applied  (Svenson,  1979) . 


Either  partial  or  total  reliance  on  a  non-compensatory 
rule  could  spell  serious  difficulties  for  the  prescriptive  (and 
hence  descriptive)  validity  of  SEU  models.  Einhorn  (1971)  pro¬ 
vided  one  of  the  first  such  demonstrations,  although  the  power 
of  linear  models  is  such  that  even  where  they  were  inferior  to 
non-compensatory  models,  they  still  provided  relatively  good 
predictions.  Lichtenstein,  Slovic  and  Zink  (1969)  tried  to 
convince  people  to  abandon  non-compensatory  strategies  in  favor 
of  using  expected  value  as  a  decision  rule,  but  to  no  avail. 
Choice  theory  offers  a  variety  of  other  non-compensatory  deci¬ 
sion  rules  (e.g.,  minimax)  whose  use  has  been  observed  in  one 
situation  or  another  (e.g.,  Coombs,  Dawes  &  Tversky,  1970). 

Here,  too,  our  ability  to  characterize  decision  situations  and 
the  strategies  they  induce  is  limited.  Attempts  to  predict  the 
usage  and  non-usage  of  compensatory  strategies  have  a  distinctly 
ad  hoc  character. 

(d)  When  consequences  are  not  evaluated  independently. 
Literal  usage  of  an  SEU  model  requires  one  to  evaluate  each  con¬ 
sequence  attribute  of  each  option  by  itself  and  then  combine 
the  results.  In  some  situations,  however,  the  decision  maker 
might  want  to  evaluate  a  particular  consequence  differently 
depending  upon  the  value  of  other  consequences.  For  example, 
one  might  like  either  a  slightly  uncomfortable  chair  or  a  mod¬ 
erate  level  of  ambient  noise  when  deciding  where  to  sit  in  a 
lecture,  knowing  that  either  will  help  overcome  a  tendency 
to  fall  asleep.  However,  a  seat  that  is  both  noisy  and  hard 
will  be  undesirable  since  it  diverts  too  much  attention  to 
overcoming  discomfort.  In  other  words,  worth  is  an  interactive 
function  of  the  attributes. 

The  concern  of  E-V  researchers  over  the  effects  of  extrin- 
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sic  reinforcements  on  intrinsic  motivation  (Deci,  1975) 
reflects  another  such  interaction,  as  it  suggests  that  the 
value  of  intrinsic  rewards  may  be  affected  by  the  specific 
extrinsic  reward  received.  This  can,  in  principle,  be  handled 
in  SEU  models,  by  evaluating  the  conjunction  of  attributes 
rather  than  each  one  separately.  In  the  previous  example,  one 
would  evaluate  separately  a  noisy  and  soft  seat,  a  noisy  and 
hard  seat,  a  quiet  and  soft  seat  and  a  quiet  and  hard  seat. 

With  all  but  the  simplest  of  attribute-attribute  interactions, 
however,  this  would  prove  quite  laborious  in  practice. 

The  prevalence  of  such  interactive  evaluations  is  the  most 
extensively  studied  aspect  of  the  search  for  configural  judges 
mentioned  earlier.  To  repeat  the  results  of  that  search:  while 
clinical  judges  often  believe  that  they  are  using  configural 
rules,  evidence  of  consistent  conf igurality  is  meager.  However, 
it  is  still  unclear  whether  this  contradiction  reflects  the 
failure  of  these  judges'  introspection  (i.e.,  they  don't  know 
what  they're  doing)  or  the  power  of  additive  linear  models. 

(e)  When  options  are  not  evaluated  simultaneously.  Simon's 
(1957)  notion  of  "satisficing"  grew  out  of  observing  the  predic¬ 
tive  failures  of  the  SEU-like  models  of  classical  economics. 
Satisficers  look  for  decision  alternatives  that  are  good  enough. 
In  this  process,  they  may  use  one  of  the  non-compensatory  rules 
described  above  or  a  compensatory  rule,  looking  for  adequate 
overall  performance.  In  any  case,  their  search  terminates  when 
a  satisfactory  option  has  been  identified  and  evaluated.  This 
option  may,  however,  be  inferior  (in  an  SEU  sense)  to  other 
options  that  are  not  considered. 

An  alternative  approach  to  assessing  the  prescriptive 
reasonableness  of  the  SEU  model  is  to  see  whether  people  accept 


the  axioms  from  which  it  is  derived.  In  1969,  Tversky  found 
that  while  people  like  to  be  transitive  in  their  choices  (thus 
fulfilling  one  axiom) ,  it  is  possible  to  design  situations  in 
which  they  are  both  intransitive  and  unable  to  resolve  the 
inconsistency  in  their  choices.  He  speculated  that  marketing 
approaches  may  be  designed  to  exploit  such  difficulties.  Another 
demonstration  of  inconsistency  appeared  in  Zagorski  (1975) ,  who 
showed  people  pairs  of  gambles  (A,  B)  and  asked  them  to  judge 
the  amount  of  money  V(A-B)  that  would  induce  them  to  trade  the 
better  gamble  (A)  for  the  worse  gamble  (B) .  He  demonstrated 
that  one  can  construct  quadruples  of  gambles  A,  B,  C,  and  D 
such  that  V(A-B)  +  V(B-C)  +  V(A-D)  +  V(D-C) .  In  other  words, 
path  independence  was  violated.  The  difference  between  gambles 
A  and  C  depends  on  whether  the  intermediate  gamble  is  B  or  D. 

The  Allais  and  Ellsberg  paradoxes  (see  Edwards,  1954,  1961)  are 
two  demonstrations  of  people  rejecting  Savage’s  independence 
axiom,  according  to  which  preferences  between  alternatives  should 
be  independent  of  any  consequences  they  have  in  common. 

Until  recently,  however,  few  theorists  were  convinced  by 
these  examples.  By  way  of  counterattack,  MacCrimmon  (1968) 
showed  that  business  executives  who  violated  various  axioms 
could  easily  be  led,  via  discussion,  to  see  the  error  of  their 
ways.  However,  Slovic  and  Tversky  (1974)  challenged  MacCrim¬ 
mon  's  discussion  procedure  on  the  grounds  that  it  pressured  sub¬ 
jects  to  accept  the  axioms.  They  presented  subjects  with  argu¬ 
ments  for  and  against  the  independence  axiom  and  found  persis¬ 
tent  violations ,  even  after  the  axiom  was  presented  in  a  clear 
and  presumably  compelling  fashion.  Moskowitz  (1974)  used  a 
variety  of  problem  representations  (matrix  formats,  trees,  and 
verbal  presentations)  to  clarify  the  principle  and  maximize  its 
acceptability,  yet  still  found  that  the  independence  axiom  was 
rejected.  Even  MacCrimmon' s  faith  in  many  of  the  key  axioms 
has  been  shaken  by  recent  data,  leading  him  to  suggest  that 
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reevaluation  of  the  theory  is  in  order  (MacCrimmon  &  Larsson, 
1976)  . 

To  be  most  useful,  such  a  reevaluation  would  have  to  go 
beyond  demonstrating  that  violation  or  rejection  of  the  axioms 
is  possible.  It  would  have  to  give  some  guidelines  as  to  the 
prevalence  and  distribution  of  violations.  Are  they  just  con¬ 
cocted  curiosities?  Or  do  they  represent  modal  behavior  in  some 
important  realms? 

Kahneman  and  Tversky  (1979)  attempted  this  sort  of  reevalu¬ 
ation,  presenting  evidence  for  two  pervasive  violations  of  SEU 
theory.  One,  the  "certainty  effect,"  causes  consequences,  both 
positive  and  negative,  that  are  obtained  with  certainty  to  be 
given  more  weight  than  uncertain  consequences.  The  Allais 
paradox  may  be  due  to  this  effect.  The  second,  labeled  the 
"reference  effect,"  leads  people  to  evaluate  alternatives  rela¬ 
tive  to  a  reference  point  corresponding  to  their  present  status, 
expectation  or  adaptation  level  (Helson,  1947,  1959).  By 
altering  the  reference  point,  formally  equivalent  versions  of 
the  same  decision  problem  may  elicit  different  preferences. 

These  effects  pose  serious  problems  for  the  validity  (prescrip¬ 
tive  and  descriptive)  of  SEU  approaches.  The  power  of  Kahneman 
and  Tversky' s  theory  is  its  ability  to  predict  responses  to  de¬ 
cision  problems  posed  in  particular  ways  and,  in  particular,  to 
predict  behavior  that  is  contrary  to  predictions  of  the  SEU 
model.  It  needs,  however,  to  be  complemented  by  a  substantive 
theory  regarding  the  way  in  which  decision  questions  are  posed 
by  nature  and  interpreted  by  observers. 

AVAILABILITY  OF  INPUTS 

Let  us  restrict  our  attention  to  situations  in  which  people 
might  wish  to  engage  in  SEU- like  behavior.  Doing  so  means  pro- 
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viding  (and  eventually  integrating)  lists  of  feasible  options 
and  possible  consequences  along  with  assessments  of  their  prob¬ 
abilities  and  values.  One  needn't  perform  these  tasks  well 
in  order  to  engage  in  SEU-like  behavior.  Poor  performance  will, 
however,  lead  to  suboptimal  decisions.  As  the  lists  become 
more  and  more  incomplete  and  the  quality  of  the  assessments 
deteriorates,  the  resultant  decision  will  tend  to  deviate  from 
that  obtained  by  the  best  possible  usage  of  the  SEU  model. 

Thus  making  decisions  not  in  one's  own  best  interests  (as 
defined  by  the  "best  possible"  SEU  decision)  need  not  mean  that 
the  SEU  model  is  an  invalid  descriptor  of  what  people  are  doing. 
Nonetheless,  acute  inability  to  provide  the  inputs  required  to 
use  the  SEU  model  well  must  cast  some  doubts  on  the  extent  of 
its  usage.  At  some  point,  one  must  stop  and  ask,  would  people 
persist  in  doing  something  they  do  so  poorly?  To  take  an 
analogous  problem,  I  may  realize  the  essential  wisdom  of  using 
the  1040  Form  for  income  tax  and  itemizing  my  deductions  in  order 
to  reduce  my  obligation.  If,  however,  experience  shows  that  I 
typically  do  it  wrong  and  land  in  trouble,  I  may  resort  to  less 
complicated  decision  rules.  Whether  or  not  it  is  reasonable 
to  expect  people  to  persist  at  suboptimal  SEU  behavior,  rather 
than  opting  for  some  simpler  rule,  requires  consideration  of 
two  questions:  (a)  Just  how  good  or  bad  are  they?  and 
(b)  How  likely  are  they  to  learn  about  their  mistakes?  If 
people  both  could  not  perform  these  tasks  and  realized  their 
limitations,  the  SEU  model  would  be  a  less  likely  candidate  for 
how  people  try  to  make  decisions. 

What  else  could  people  do,  besides  trying  to  list  and  assess 
the  expected  consequences  of  all  courses  of  action  open  to  them? 
For  one,  they  could  try  not  to  think  at  all,  but  rely  on  non- 
analytical  decision  rules  like  "this  is  (most  like)  what  I've 
always  done"  or  "this  is  what  my  (most  expert)  friends  tell  me 
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to  do"  or  "this  is  what  everyone  else  is  doing."  Or  they  could 
refuse  to  make  one  final  decision,  preferring  to  muddle  through 
by  trial  and  error,  making  small  incremental  decisions  that  af¬ 
ford  an  opportunity  to  change  courses  if  things  aren't  going 
well . 


Relying  on  the  collective  wisdom  of  one's  own  experience 
(habit)  or  of  one's  peers'  experience  (tradition)  or  of  one's 
peers'  current  feelings  are  all  ways  of  externalizing  the  prob¬ 
lem.  Each  recruits  a  number  of  people  to  think  about  issues 
too  complex  and  poorly  understood  to  analyze  alone.  Both  common 
wisdom  and  such  ^u  courant  techniques  as  Delphi  advocate  such 
thought  sharing.  One  could,  of  course,  consult  these  sources 
for  opinions  that  would  eventually  be  used  in  a  personal  SEU 
model.  Or  one  could  reject  the  methodological  individualism 
of  SEU  decision  making  in  favor  of  whatever  conclusion  emerges 
from  these  chaotic  group  processes.  Such  reliance  on  group 
rather  than  individual  "rationality"  is  well  documented  in 
work  situations  (Salancik  &  Pfeffer,  1977,  1978;  White  & 

Mitchell,  1979) .  Perhaps  the  best  known  example  is  the  emerg¬ 
ence  of  group  production  norms  in  defiance  of  piece-work  pay 
schemes . ^ 

One  of  the  few  exercises  in  complex  analytic  decision  making 
imposed  upon  most  people  is  completion  of  income  tax  forms.  The 
success  of  H  &  R  Block  suggests  the  reluctance  of  the  rest  of 
us  to  face  the  challenge. 

Problem  Structuring 

Like  other  analytic  approaches,  SEU  begins  with  a  structur¬ 
ing  of  the  problem,  namely  listing  all  relevant  options  and 
consequences.  The  obvious  performance  standard  here  is  complete¬ 
ness.  A  modest  amount  of  systematic  and  anecdotal  evidence 
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suggests  that  people  have  difficulty  independently  producing 
complete  problem  representations.  When  asked  to  judge  complete¬ 
ness  of  problems  presented  by  others,  however,  they  do  not  seem 
to  be  very  sensitive  to  these  inadequacies.  What  is  out  of 
sight  is  effectively  out  of  mind  (Fischhoff,  Slovic  &  Lich¬ 
tenstein,  1977,  1978).  Thus  in  the  short  run  at  least,  people 
may  not  realize  this  limit  on  their  analytical  decision-making 
ability.  In  the  long  run,  omitting  options  and  consequences 
should  lead  to  poor  decisions.  However,  life  seldom  sends  us 
large  enough  batches  of  unambiguous  signals  to  make  it  clear 
just  how  good  our  decision  making  is  and  where  the  problems 
lie  (Fischhoff,  1975;  Einhorn  &  Hogarth,  1978)  .  While  unrecog¬ 
nized  incompleteness  seems  to  be  quite  general,  there  presumably 
are  decision  problems  whose  most  important  features  are  readily 
uncovered  by  people's  intuitive  procedures.  Identifying  that 
set  of  problems  would  require  the  sort  of  substantive  theory 
of  people  in  situations  which  has  interested  E-V  researchers. 

Assessing  Probabilities 

A  cornerstone  of  SEU  thinking  is  that  we  all  live  in  an 
uncertain  world.  According  to  the  Bayesian  or  subjectivist 
position  that  uncertainty  reflects  the  limits  of  our  understand¬ 
ing.  From  this  position,  all  statements  are  implicitly  or 
explicitly  qualified  by  our  degree  of  belief  in  them.  Degrees 
of  belief  are  numerically  expressed  in  subjective  probabilities, 
i.e.,  expressions  with  the  form  "My  personal  probability  that 
Statement  A  is  true  is  .XX".  Such  probabilities  are  entirely 
in  the  eye  of  the  beholder  and  are  not  properties  of  the  world. 

Two  observers  of  a  situation  can  in  principle  assign  different 
probabilities  of  being  true  to  the  same  statement  about  it,  either 
because  they  interpret  what  they  see  differently  or  have  differ¬ 
ent  background  information. 


In  fact,  the  whole  notion  of  "objective  probability”  is 
confused.  Assigning  any  probability  requires  some  interpreta¬ 
tion  of  the  observed  situation  and  that  act  of  interpretation 
imputes  a  subjective  element.  What  are  commonly  thought  of 
as  objective  probabilities  merely  refer  to  situations  in  which 
there  is  consensus  among  reasonable  observers  about  how  to 
interpret  existing  evidence.  The  probability  of  a  fair  coin 
falling  on  "heads"  on  its  next  flip  is  commonly  held  to  be  .5, 
but  that  is  only  because  observers  agree  on  how  to  evaluate 
past  performance  and  relate  it  to  subsequent  performance  (e.g., 
what  constitutes  an  adequate  series  of  independent,  equivalent 
trials) .  Bayesians  do  not  talk  about  estimating  probabilities 
which  implies  that  there  is  something  "out  there"  in  the  world 
to  be  appraised,  but  of  assessing  probabilities,  signifying  the 
evaluation  of  an  internal  state. 

If  probabilities  are  subjective,  does  it  make  any  sense 
to  consider  their  validity?  Individual  probabilities  can 
almost  never  be  evaluated  (except  in  the  case  where  one  says 
"There  is  absolutely  no  chance  of  this  statement  being  wrong" 
regarding  a  statement  that  is,  in  fact,  wrong). 

Sets  of  subjective  probabilities  may,  however,  be  evalu¬ 
ated  according  to  two  criteria.  One  is  consistency  with  the 
laws  of  probability  or  (internal)  coherence  (Kyburg  &  Smokier, 
1964)  .  For  example,  the  probabilities  assigned  to  a  statement 
and  its  complement  should  total  1.0.  The  second  is  "calibration," 
applicable  when  the  truth  of  statements  is  known  to  the  proba¬ 
bility  assessor.  The  well-calibrated  probability  assessor  should 
have  more  true  statements  associated  with  high  than  with  low 
probabilities.  Specifically,  XX%  of  the  statements  assigned 
probability  .XX  should  be  perceived  to  be  true.  That  is,  if  I 
am  well  calibrated,  70%  of  the  time  when  I  say  "there  is  a  .7 
chance  of  this  statement  being  true,"  it  should  be  true. 
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The  earliest  studies  of  probability  assessment  were  restric¬ 
ted  to  situations  in  which  consensus  could  be  reached  on  "objec¬ 
tive"  probability  values.  These  experiments  used  repetitive 
series  of  events  (like  drawing  red  or  blue  balls  from  an  urn) 
and  defined  probability  as  relative  frequency  of  occurrence. 
Research  showed  that  people  were  quite  good  at  appraising  such 
f requentistic  probabilities  (Peterson  &  Beach,  1967) . 

More  recently,  the  focus  of  research  has  turned  to  how 
people  assign  probabilities  to  statements,  including  unique 
events  (e.g..  Carter  will  be  re-elected),  where  relative  fre¬ 
quency  has  no  meaning.  This  shift  seems  to  reflect  a  growing 
acceptance  of  the  subjectivist  position,  a  feeling  that  one 
should  study  probabilities  in  their  most  general  form  and  a 
realization  that  relatively  few  important  events  in  people's 
lives  are  thought  of  in  terms  of  relative  frequency. 

Here,  performance  has  been  less  than  outstanding.  Experi¬ 
ments  using  a  variety  of  tasks,  response  modes  and  subjects 
have  shown  that  probability  assessments  tend  to  be  poorly  cali¬ 
brated  (Lichtenstein,  Fischhoff  &  Phillips,  1977).  The  most 
common  type  of  error  is  overconfidence,  i.e.,  people  think  that 
they  know  more  than  is  actually  the  case.  Where  observed,  this 
bias  seems  to  be  so  robust  that  people  are  willing  to  engage 
in  highly  disadvantageous  bets  based  upon  their  confidence 
judgments  (Fischhoff,  Slovic  &  Lichtenstein,  1977)  . 

Various  forms  of  incoherence  have  also  been  observed,  al¬ 
though  without  any  clear  indication  of  their  prevalence.  For 
example,  Wyer  (1974)  found  that  people  tend  to  exaggerate  the 
probability  of  the  conjunction  of  two  events,  relative  to  the 
probabilities  assigned  to  the  two  individually.  Kahneman  and 
Tversky  (1973)  found  that  rather  than  combining  background  (base 
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rate)  information  with  information  regarding  a  particular  case, 
people  tend  to  ignore  base-rate  probabilities.  Slovic,  Fisch- 
hoff  &  Lichtenstein  (1976)  demonstrated  situations  in  which  the 
judged  probability  of  compound  events  was  larger  than  the  prob¬ 
abilities  of  their  constituent  events. 

It  is  disturbing  that  P(A/0  B)  should  ever  be  greater  than 
P  (A)  or  P(B) ,  but  some  notion  of  prevalence  is  needed  to  decide 
whether  the  problem  is  so  bad  as  to  indicate  either  (a)  that 
people  would  realize  their  limits  and  avoid  probabilistic 
thinking  or  (b)  that  the  quality  of  people's  probabilistic 
thinking  is  so  poor  as  to  suggest  that  they  seldom  engage  in 
it.  Such  a  study  of  prevalence  and  extent  would  have  to  be 
accompanied  by  an  analysis  of  how  bad  performance  would  have 
to  be  for  people  to  notice.  In  this  context,  the  folklore  of 
survey  research  may  be  instructive.  Popular  wisdom  there 
holds  that  one  cannot  ask  for  numerical  probability  assess¬ 
ments  from  a  random  sample  of  the  population  without  getting 
disturbingly  high  rates  of  non-response  (i.e.,  over  20%). 
Regarding  detailed  verbal  statements  of  probability,  the  experi¬ 
ence  of  the  1977  Quality  of  Employment  Survey  (Quinn  &  Staines, 
1978)  is  instructive.  Pretest  work  resulted  in  probability 
assessments  being  gathered  on  a  response  scale  with  only  two 
alternatives:  "Yes,  it  is  likely"  and  "No,  it  is  not  likely." 

The  test-retest  reliability  of  more  detailed  verbal  probability 
assessment  has  been  found  at  times  to  be  surprisingly  low  (r  = 
.52-. 56  for  DeLeo  &  Pritchard,  1974;  Lied  &  Pritchard,  1976). 

On  the  other  hand,  people  do  seem  to  have  some  appreciation 
of  the  meaning  of  probabilistic  weather  forecasts  (Murphy, 
Lichtenstein,  Fischhoff  &  Winkler,  1979) .  Perhaps  they  feel 
more  comfortable  with  the  probability  of  rain  because  the  event 
is  highly  repetitive  and  their  task  is  passive  (listening) . 

Perhaps  the  most  disturbing  aspect  of  these  results  for  a 
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proponent  of  SEU  descriptive  models  is  not  the  presence  of 
biases,  but  the  thought  processes  that  seem  to  underlie  both 
erroneous  and  accurate  probability  assessments.  Tversky  and 
Kahneman  (1974)  have  proposed  that  many  probabilistic  judgments 
are  produced  by  using  rules  of  thumb  or  heuristics  whose  inter¬ 
nal  logic  bears  little  resemblance  to  the  rules  of  probability. 
These  are  generally  effective  ways  of  coping  with  an  uncertain 
environment  that  both  deny  its  probabilistic  character  and 
preclude  learning  about  some  basic  phenomena  (like  regression 
to  the  mean) .  Often  they  lead  to  substantially  biased  judgments 
and  decisions  not  in  the  individual's  best  interest.**  To  the 
extent  that  these  heuristics  capture  people's  thought  processes, 
they  (a)  argue  strongly  against  the  notion  that  we  are  efficient 
probabilistic  functionalists,  well- tuned  to  the  uncertain  struc¬ 
ture  of  our  environment,  and  (b)  even  when  people  do  incorpor¬ 
ate  uncertainty  in  their  decision  making,  they  do  so  in  terms 
quite  different  from  the  formally  defined  probabilities  appear¬ 
ing  in  SEU  models. 

Assessing  Values 

The  study  of  value  judgments  and  their  validity  by  behav¬ 
ioral  decision  theory  researchers  has  languished  relative  to 
the  study  of  probability  judgments.  The  presumed  reason  for 
this  apparent  disinterest  is  the  absence  of  an  acceptable 
criterion  by  which  to  evaluate  value  judgments.  Such  judgments 
would  seem  to  be  the  last  redoubt  of  unaided  intuition.  Who 
knows  better  than  an  individual  what  he  or  she  prefers? 

Recent  research  has,  however,  revived  and  elaborated  a 
lesson  long  known  to  attitude  researchers .  Subtle  changes  in 
how  value  questions  are  phrased  and  responses  are  elicited 
can  have  marked  effects  on  the  preferences  people  express.  This 
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lability  in  value  judgments  seems  to  have  important  implica¬ 
tions  for  both  how  values  are  conceptualized  and  how  they  are 
studied . 

An  article  of  faith  among  students  of  value,  choice  and 
attitude  judgments  is  that  people  have  reasonably  well-defined 
opinions  regarding  the  desirability  of  various  events.  Although 
these  opinions  may  not  be  intuitively  formulated  in  numerical 
(or  even  verbal)  form,  careful  questioning  can  elicit  judgments 
representing  people's  underlying  values.  From  this  stance, 
elicitation  procedures  are  neutral  tools,  bias-free  channels 
which  translate  subjective  feelings  into  scientifically  usable 
expressions.  They  impose  no  views  on  respondents,  beyond 
focusing  attention  on  those  value  issues  of  interest  to  the 
investigator. 

It  is  by  no  means  obvious,  however,  that  people's  values 
are  well  defined.  Observed  test- retest  reliabilities  in  the 
values  of  desired  outcomes  (within  situations)  have  been 
fairly  modest  (r  *  .48,  .60  in  DeLeo  &  Pritchard,  1974;  Lied 
&  Pritchard,  1976)  and  remarkably  little  consistency  in  the 
rated  valence  of  outcomes  across  situations  was  observed  by 
Muchinsky  (1977) .  Lawler  (1971)  found  wide  variations  in  the 
importance  attached  to  one  central  outcome  of  work,  pay,  at 
least  partially  due  to  variations  in  elicitation  procedures. 

What  happens  in  cases  where  people  do  not  know,  or  have 
difficulty  appraising,  what  they  want?  Under  such  circumstan¬ 
ces,  these  procedures  may  become  major  forces  in  shaping  the 
values  expressed,  or  apparently  expressed,  in  the  judgments  they 
elicit.  They  can  induce  random  error  (by  confusing  the  respon¬ 
dent)  ,  systematic  error  (by  hinting  at  what  the  "correct" 
response  is) ,  or  unduly  extreme  judgments  (by  suggesting  clarity 
and  coherence  of  opinion  that  are  not  warranted) .  In  such 


cases,  the  method  becomes  the  message,  a  reflection  of  the  re¬ 
searcher's  tools  rather  than  the  respondent's  wishes. 

People  are  most  likely  to  have  clear  preferences  regarding 
issues  that  are  familiar,  simple,  and  directly  experienced. 7 
Each  of  these  properties  is  associated  with  opportunities  for 
trial-and-error  learning,  particularly  such  learning  as  may  be 
summarized  in  readily  applicable  rules  or  homiles.  Those  rules 
provide  stereotypic,  easily  justifiable  responses  to  future 
questions  of  value.  When  adopted  by  individuals,  they  may  be 
seen  as  habits;  when  adopted  by  groups,  they  constitute  tradi¬ 
tions  . 

The  power  of  these  rules  of  thumb  for  assessing  values 
comes  from  their  development  and  application  to  simple  and  repe¬ 
titive  problems.  Their  viability  becomes  suspect  when  the  issues 
are  unfamiliar  and  complex,  the  old  intuitions  impotent,  the 
old  rules  untested  and  perhaps  untes table.  Unfortunately,  these 
are  precisely  those  situations  in  which  values  are  worth  study¬ 
ing.  In  them,  however,  we  may  never  have  considered  the  impli¬ 
cations  of  the  values  and  beliefs  acquired  in  simpler  settings. 

As  a  result,  we  may  have  no  articulated  preferences.  In  some 
fundamental  sense,  our  values  may  be  incoherent,  not  thought 
through.  In  thinking  about  what  are  acceptable  levels  of  risk, 
for  example,  we  may  be  unfamiliar  with  the  terms  in  which  issues 
are  formulated  (e.g.,  social  discount  rates,  miniscule  probabi¬ 
lities,  or  megadeaths).  We  may  have  contradictory  values  (e.g., 
a  strong  aversion  to  catastrophic  losses  of  life  and  a  realiza¬ 
tion  that  we're  not  more  moved  by  a  plane  crash  with  500  fatal¬ 
ities  than  one  with  300) .  We  may  occupy  different  roles  in  life 
(parents,  workers,  children)  which  produce  clear-cut,  but  incon¬ 
sistent  values.  We  may  vacillate  between  incompatible,  but 
strongly  held,  positions  (e.g.,  freedom  of  speech  is  inviolate, 
but  should  be  denied  to  authoritarian  movements) .  We  may  not 
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even  know  how  to  begin  thinking  about  some  issues  (e.g.,  the 
appropriate  tradeoff  between  the  opportunity  to  dye  one's 
hair  and  a  vague,  minute  increase  in  the  probability  of  cancer  20 
years  from  now) .  Our  views  may  undergo  changes  over  time  (say, 
as  we  near  the  hour  of  decision  or  the  consequence  itself)  and 
we  may  not  know  which  view  should  form  the  basis  of  our  decision. 

Such  inarticulated  preferences  are  hardly  compatible  with 
the  sort  of  rigorous  systematic  thinking  required  by  SEU 
decision  making.  Furthermore,  they  give  pause  for  thought  about 
just  what  subjects  are  giving  us  when  we  elicit  the  values  needed 
to  test  the  conformity  of  their  behavior  with  the  SEU  model. 

Listing  a  few  specific  effects  may  indicate  the  power  an 
elicitor  may  deliberately,  or  inadvertently,  wield  in  shaping, 
distorting  or  even  creating  expressed  preferences.  The  desir¬ 
ability  of  possible  outcomes  is  often  evaluated  in  relation  to 
some  reference  point.  That  point  could  be  one's  current  (asset) 
position,  or  an  expected  level  of  wealth  (what  someone  with  my 
talents  should  be  worth  at  time  t) ,  or  that  possessed  by  another 
person.  Shifts  in  reference  point  are  fairly  easily  effected 
and  can  lead  to  appreciable  shifts  in  judged  desirability, 
even  to  reversals  in  the  order  of  preference.  Consider,  for 
example,  how  one  might  think  about  the  same  safety  program 
conceptualized  in  terms  of  lives  saved  or  lives  lost,  with  the 
respective  reference  points  of  the  current  situation  or  an 
ideal  one.  As  one  gets  closer  to  an  event  with  mixed  conse¬ 
quences,  the  aversiveness  of  its  negative  aspects  may  increase 
more  rapidly  than  the  attractiveness  of  its  positive  aspects, 
making  it  appear,  on  the  whole,  less  desirable  than  it  did  from 
a  distance.  People  may  have  opposite  orders  of  preference 
for  gambles  when  asked  which  they  prefer  (which  focuses  their 
attention  on  how  likely  they  are  to  win)  and  when  asked  how 
much  they  would  pay  to  play  each  (which  highlights  the  amount 
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to  win) .  People  may  prefer  to  take  a  chance  at  losing  a  large 
sum  of  money  rather  than  absorb  a  smaller  sure  loss,  but  change 
their  mind  when  the  sure  loss  is  called  an  insurance  premium. 

A  relatively  unimportant  attribute  may  become  the  decisive 
factor  in  choosing  between  a  set  of  options  if  they  are  pre¬ 
sented  in  such  a  way  that  that  attribute  affords  the  easiest 
comparison  between  them. 

Three  important  features  of  these  shifting  judgments  are: 
(a)  people  are  typically  unaware  of  the  potency  of  such  shifts 
in  their  perspective,  (b)  they  often  have  no  guidelines  as  to 
which  perspective  is  the  appropriate  one,  and  (c)  even  when 
there  are  guidelines ,  people  may  not  want  to  give  up  their  own 
inconsistency  (Fischhoff ,  Slovic  &  Lichtenstein,  1978,  1980) . 

Elicitors  must  decide  at  some  point  whether  or  not  they 
have  adequately  captured  their  respondent's  values.  The  usual 
criteria  are  reliability  and  internal  consistency  (e.g., 
transitivity) .  However,  where  the  task  is  poorly  understood 
because  of  complexity  or  unfamiliarity,  consistency  of  response 
within  a  given  elicitation  mode  may  tell  us  little  beyond  the 
power  of  that  mode  to  impose  a  particular  perspective.  Consis¬ 
tency  of  response  is  a  necessary  but  not  sufficient  condition 
for  coherence  of  the  underlying  values.  Greater  insight  into 
values  may  come  from  posing  diverse  questions  in  the  hope  of 
eliciting  inconsistent  responses.  Therefore,  one  would  want 
to  start  the  study  of  values  with  methodological  pluralism 
and  treat  inconsistency  in  expressed  values  as  a  success  rather 
than  a  failure  of  measurement,  for  it  indicates  contexts  defined 
sharply  enough  to  produce  a  difference. 

If  one  is  interested  in  how  people  express  their  values  in 
the  real  world,  one  question  may  be  enough.  That  world  often 
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asks  only  one  question  (e.g.,  in  a  ballot  measure) .  A  care¬ 
ful  analysis  of  how  an  issue  is  posed  may  allow  one  to  identify 
that  question  and  accurately  predict  responses. 


If  one  is  interested  in  what  people  really  feel  about  a 
value  issue,  there  may  be  no  substitute  for  an  interactive, 
dialectical  elicitation  procedure,  one  that  acknowledges  the 
elicitor's  role  in  helping  the  respondent  to  create  and  enunci¬ 
ate  values.  That  help  would  include  a  conceptual  analysis  of 
the  problem  and  of  the  personal,  social,  ethical  value  issues 

O 

to  which  the  respondent  might  wish  to  relate. 

DECISION  CONTEXTS 


While  behavioral  decision  theorists  have  not  developed  a 
theory  of  decision  situations  to  complement  their  evolving 
theory  of  the  individual  decision  maker,  they  have  upon  occa¬ 
sion  attempted  to  replicate  their  laboratory  studies  in  the 
real  world.  These  experiments  at  roughing  it  have  been  prompted 
by  (a)  the  feeling  that  decision  theory  like  other  areas  in 
applied  cognitive  psychology  should  work  in  the  world;  (b)  the 
refusal  of  economists  to  accept  (or  even  look  at)  laboratory 
data;  (c)  the  hope  of  being  able  to  elaborate  their  theories 
by  embedding  them  in  a  broader  context  and  (d)  continued  inter¬ 
action  with  decision  analysts,  purveyors  of  SEU  as  a  normative 
guide  to  decision  making  who  are  particularly  attuned  to  the 
subjective  quality  of  judgments  of  fact  and  value. 

One  obvious  step  in  the  direction  of  realism  is  to  use 
real  rather  than  hypothetical  stakes  in  studies  of  decision 
making.  A  second  is  to  use  experts  rather  than  naive  subjects. 
Although  available  evidence  is  modest,  neither  manupulation 
has  so  far  provided  sufficiently  dramatic  results  to  cast  ser¬ 
ious  doubt  on  most  of  the  research  conclusions  cited  above 
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(Slovic,  Fischhoff,  Lichtenstein,  1979;  Tversky  &  Kahneman, 

1974)  .  A  third  obvious  step,  using  more  realistic  stimuli, 
has,  it  seems, only  strengthened  conclusions  regarding  the  labil¬ 
ity  of  judgments  of  value  and  liabilities  of  judgments  of  fact 
(Lichtenstein  et  al.,  1978;  Fischhoff,  Slovic  &  Lichtenstein, 
1978)  . 

None  of  these  manipulations,  however,  leads  to  the  study 
of  actual  judgments  in  situ.  Probably  the  most  elaborate  ex¬ 
cursion  of  behavioral  decision  theory  into  the  real  world  has 
been  a  study  of  disaster  insurance  protection  by  Kunreuther  et 
al.  (1978) .  Despite  massive  investments  in  flood  control  pro¬ 
jects,  flood  damage  in  the  U.S.  continues  to  climb.  The  main 
reason  seems  to  be  the  overdevelopment  of  flood  plains  caused 
in  part  by  the  flood  control  projects  themselves.  By  elimina¬ 
ting  frequent  minor  floods,  these  projects  have  unduly  reduced 
residents'  feeling  of  flood  danger.  As  a  result,  when  a  suf¬ 
ficiently  large  rain  or  thaw  somes  along  and  overwhelms  the 
project,  much  more  property  is  exposed.  Thus  floods  are  less 
frequent,  but  the  loss  from  each  is  much  greater  than  prior  to 
the  project.  In  order  to  reduce  losses,  the  U.S.  Congress 
enacted  a  flood  insurance  program  which  mandated  land-use 
planning  as  the  price  for  making  insurance  available  at  highly 
attractive  rates.  "Highly  attractive"  was  determined  by  econo¬ 
mists  who  assumed  that  residents  shared  the  risk  and  cost  data 
in  the  hands  of  the  planners  and  combined  them  according  to  SEU 
principles.  Unfortunately,  almost  no  one  bought  the  insurance. 

A  national  survey  of  flood-prone  areas  designed  to  under¬ 
stand  the  failure  of  the  program  discovered  that  residents' 
judgments  of  risks  and  costs  were  very  different  than  those 
of  the  experts.  Often  these  mis  judgments  seemed  to  have  the 
same  roots  as  the  judgmental  biases  observed  in  laboratory 
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studies.  Obviously,  even  if  residents  were  SEU  decision  makers, 
they  could  not  be  expected  to  behave  in  the  predicted  way  (i.e., 
purchase  insurance)  if  they  did  not  share  the  planners'  infor¬ 
mation  base.  The  design  of  Kunreuther's  study  allowed  an  ap¬ 
praisal  of  how  favorable  insurance  should  be  to  these  indivi¬ 
duals  given  what  they  thought  to  be  the  facts  of  the  matter 
and  assuming  that  they  were  SEU  decision  makers.  A  straight¬ 
forward  derivation  translated  attractiveness  of  insurance  into 
an  index  K,  with  higher  K  values  being  associted  with  more  at¬ 
tractive  insurance  situations.  Comparison  of  individuals  who 
had  and  had  not  purchased  insurance  revealed  no  difference  at 
all  in  the  distribution  of  K  values.  Thus  whatever  guided 
insurance  purchase  decisions,  it  was  not  an  SEU  analysis  based 
on  either  the  "real"  (experts')  or  "imagined"  (residents') 
view  of  the  situation. 

What  did  guide  decisions?  Kunreuther's  best  guess  was 
that  people  relied  upon  a  series  of  informal  decision  rules  like 
the  answers  to:  Are  my  neighbors  and  relatives  buying  it? 

What  does  my  insurance  agent  say?  Do  I  have  the  capacity  to 
worry  about  one  more  thing? 

The  application  of  E-V  theory  to  predicting  effort  and  per¬ 
formance  in  the  work  place  may  be  considered  a  prolonged  exten¬ 
sion  of  SEU  theory  into  the  real  world.  In  an  extensive  review 
of  this  effort,  Schwab  et  al.  (1979)  found  that  on  the  average, 
only  9%  of  the  variance  in  effort  or  performance  is  explained. 

Granting  that  their  review  considered  only  between-sub ject  appli¬ 
cations  of  this  within-person  behavioral  choice  theory  (thereby 
increasing  the  amount  of  error  variance) ,  analogous  within-per¬ 
son  studies  rarely  explain  more  than  25%  of  the  variance.  Fur¬ 
thermore,  with  one  exception,  the  few  factors  that  Schwab  and 
his  colleagues  identify  as  contributing  to  better  prediction 
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are  either  not  those  prescribed  by  E-V  models,  or  antitheti¬ 
cal  to  them.  The  authors  conclude  by  noting  that: 

Despite  these  qualifications,  there  is  a  nagging 
suspicion  that  expectancy  theory  overintellectualizes 
the  cognitive  processes  people  go  through  when  choosing 
alternative  actions  (at  least  insofar  as  choosing  a  level 
of  performance  or  effort  is  concerned) .  The  results  of 
the  present  review  are  consistent  with  this  suspicion 
(p.  146)  . 

More  pessimistically,  Staw  (1977)  suggests  that  ordinary  choice 
behavior  is  better  modeled  as  "impulse  buying"  decisions  of 
consumers  than  on  the  lines  of  SEU  theory. 

Perhaps  more  revealing  than  the  contexts  to  which  research' 
ers  have  attempted  to  extend  laboratory  results  are  the  con¬ 
texts  that  they  have  avoided.  Situations  to  which  they  have 
assumed  that  results  will  not  generalize  may  provide  an  impli¬ 
cit  partial  theory  of  the  environment.  One  characteristic  of 
most  behavioral  decision  theory  experiments  is  that  every 
attempt  is  made  to  eliminate  any  temptation  for  strategic 
responding.  Rather,  subjects  are  rewarded  for  responding  as 
honestly  as  they  can.  However,  there  obviously  are  situations 
where  it  may  pay  to  lie  (e.g.,  when  the  boss  asks  me  about 
my  work  environment)  and  they  may  engender  rather  different 
reporting  behavior.  A  second  characteristic  is  the  absence  of 
time  and  emotional  pressure  creating  the  sort  of  "hot  cogni¬ 
tion"  studied  by  Janis  and  Mann (19 77) .  A  third  characteris¬ 
tic  is  isolation  of  the  individual  from  social  interactions, 
which  may  deprive  the  individual  of  needed  decision-making  aids 
(i.e.,  talking  to  others)  and  foster  individualism.  A  fourth 
characteristic  of  most  studies  is  the  preclusion  of  a  no-choice 
or  procrastination  option  (Corbin,  1980) . 


Each  of  these  characteristics  might  in  principle  represent 


an  environmental  variable  restricting  the  validity  of  behav¬ 
ioral  decision  theory  results.  Or  each  might  provoke  minor 
perturbations  like  the  use  of  real  stakes  or  the  shift  from 
naive  to  expert  subjects. 


CONCLUSION 


The  story  of  SEU  research  has  in  some  senses  been  a  tale 
of  deadends  and  hard-earned  lessons.  An  enormous  amount  of 
effort  was  devoted  to  "capturing"  decision  strategies  before 
it  was  realized  that  the  power  of  linear  models  virtually  pre¬ 
cluded  learning  what  weights  individuals  were  giving  to  dif¬ 
ferent  attributes  or  if  they  were  using  an  SEU  decision  rule 
at  all.  Many  studies  of  how  people  estimate  relative  fre¬ 
quencies  were  conducted  before  researchers  realized  that  what¬ 
ever  their  intrinsic  interest  such  tasks  were  not  particularly 
relevant  to  the  sort  of  uncertainty  in  most  decision  situa¬ 
tions.  For  years,  researchers  derived  satisfaction  from 
the  elicitation  of  consistent,  reliable  value  judgments, 
before  beginning  to  worry  that  such  orderliness  was  a  product 
of  their  methods.  Being  rational  individuals,  decision  scien¬ 
tists  have  repeatedly  assumed  that  the  inherent  reasonableness 
of  the  SEU  model  would  make  it  an  acceptable  normative  guide  to 
decision  making.  Those  who  doubted  the  model  have  naively 
felt  that  showing  spot  inconsistencies  or  rejection  of  axioms 
would  cause  the  model  to  keel  over. 

Having  disabused  ourselves  of  these  particular  illusions, 
where  do  we  go  from  here?  Obviously,  we  should  have  better 
respect  for  the  power  of  our  methodology  to  produce  orderly 
but  misleading  results.  Substantively,  we  should  begin  to 
develop  an  understanding  of  decision  environments  which  will 
enable  us  to  understand  how  general  laboratory  results  are  and 
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how  far  reaching  their  effects  (Ebbesen  &  Konecni,  1980; 

Howell  &  Burnett,  1978) .  One  key  component  of  this  under¬ 
standing  will  be  an  error  theory  explicating  the  implications 
of  poorly  structuring  decisions,  assessing  probabilities  and 
values,  and  combining  the  various  components.  The  sensitivity 
of  our  decisions  and  their  consequences  to  deviations  from 
optimality  will  clarify  which  principles  of  decision  making 
need  to  be  and  can  be  learned  from  experience  (Einhorn  &  Ho¬ 
garth,  1978;  Fischhoff,  1980a,  b) . 

Theoretically,  we  need  to  go  beyond  simply  trying  to  falsi¬ 
fy  SEU  theory  which,  if  only  because  of  the  power  of  linear 
models,  will  almost  always  provide  at  least  mediocre  predic¬ 
tions.  Instead,  we  need  to  move  on  to  sophisticated  falsifica¬ 
tion  (Lakatos,  1970),  finding  theories  that  do  what  SEU  does 
and  at  least  a  little  more.  The  shape  of  those  theories  might 
be  new  decision  calculi  with  different  primitives  and  combina¬ 
tions  rules,  like  Kahneman  and  Tversky's  Prospect  Theory  (1979); 
or  process-tracing  models  that  attempt  to  treat  not  only  input- 
output  relationships,  but  also  the  intervening  thought  processes 
(Payne,  Braunstein  &  Carroll,  1978;  Svenson,  1979);  or  contin¬ 
gency-based  models  that  preserve  some  of  the  SEU  logic  but  also 
incorporate  a  theory  of  the  environment  and  the  way  in  which 
it  is  sequentially  decomposed  (Beach  &  Mitchell,  1978) ; 
or  inventories  of  the  rules  of  thumb  that  people  use  to  supple¬ 
ment  or  replace  analytical  decision  making  (Kunreuther  et  al., 
1978)  ;  or  predictions  of  how  the  habits  ingrained  from  making 
repetitive  decisions  with  an  opportunity  for  trial-and-error 
learning  lead  one  astray  when  one  must  make  analytic  decisions 
and  get  them  right  the  first  time. 
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FOOTNOTES 

1.  One  source  of  resistance  only  tangentially  relevant  to  our 
story  lies  in  the  disciplinary  prejudices  of  the  economists  and 
(to  a  lesser  extent)  the  philosophers  instrumental  in  launching 
SEU.  The  sophisticated  mathematical  derivations  and  elegant 
symbol  manipulations  which  are  the  stock  in  trade  of  these  two 
groups  require  a  degree  of  formalism  like  that  found  in  SEU. 

The  theory  did  not  fall  over  dead  with  the  inconsistencies 
identified  by  Allais  or  Ellsberg,  for  example,  because  neither 
offered  an  alternative  calculus  upon  which  these  professionals 
could  work  their  magic. 

2.  Functional  measurement  is  one  of  the  few  techniques  designed 
to  do  this  (Anderson,  1978).  While  studies  in  this  vein  have 
produced  useful  insights,  literal  acceptance  of  their  conclu¬ 
sions  requires  some  heady  assumptions,  particularly  regarding 
the  verisimilitude  of  responses  to. involved  factorial  designs 
and  highly  schematic  stimuli.  For  example,  one  must  be  convinced 
that  subjects  do  not  solve  the  problems  presented  by  such  tasks 
by  concocting  situation-specific  decision  rules  that  emerge, 
upon  analysis,  as  highly  systematic  behavior. 

3.  See  Beach  and  Mitchell  (1978)  for  a  predictive  model  using 
transaction  costs,  albeit  not  labeled  as  such,  as  a  major 
determinant  in  the  selection  of  decision-making  strategy. 

4.  Influence  in  the  opposite  direction  is  also  possible.  Prob¬ 
abilities  may  be  influenced  by  utilities,  as  when  optimists  exag¬ 
gerate  the  probability  of  good  things  happening.  There  appears 
at  the  moment  not  to  be  any  hard  evidence  that  such  distortion 
occurs  (Wallsten,  1971) . 

5.  A  resolute  SEU  devotee  could  still  claim  that  the  group  had 
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been  used  to  restructure  the  decision  problem  offered  by 
management  (piecework) .  Adherence  to  group  production  norms 
is  actually  the  result  of  individual  SEU  maximizing  in  the 
context  of  the  new  decision  problem  which  incorporates  group 
action  as  a  new  option,  group  sanctions  and  collectively  attained 
pay  schemes  as  new  consequences  and  collective  bargaining  power 
as  a  new  fact  of  life  affecting  the  probabilities  of  all  conse¬ 
quences  . 

6.  While  Kahneman  and  Tversky's  research  has  come  to  be  known 
for  its  identification  of  errors,  they  make  no  statement  about 
how  bad  judgment  is  in  general.  They  focus  on  errors  because 
there  are  fewer  ways  to  explain  a  pattern  of  errors  than  a 
pattern  of  success  and  because  suboptimal  behavior  in  conditions 
encouraging  optimality  suggests  deep-seated  cognitive  tendencies. 

7.  For  example.  Ash,  Levine  &  Edgell  (1979)  found  that  the 
reliability  of  the  rated  desirability  of  attributes  increases 
with  respondents'  direct  experience  with  the  job. 

8.  As  an  example  of  the  power  of  the  interactive  approach  to 
assessing  values,  Matsui  and  Ikeda  (1976)  and  Rosenberg  (1956) 
were  better  able  to  predict  people's  decisions  when  using  out¬ 
comes  generated  on  the  spot  by  subjects  than  with  a  more 
comprehensive  list  of  outcomes  that  they,  as  experimenters, 
provided . 
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